首页> 外文OA文献 >Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer

【2h】

Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer

机译：学习多任务和多机器人的模块化神经网络策略传递

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Reinforcement learning (RL) can automate a wide variety of robotic skills,but learning each new skill requires considerable real-world data collectionand manual representation engineering to design policy classes or features.Using deep reinforcement learning to train general purpose neural networkpolicies alleviates some of the burden of manual representation engineering byusing expressive policy classes, but exacerbates the challenge of datacollection, since such methods tend to be less efficient than RL withlow-dimensional, hand-designed representations. Transfer learning can mitigatethis problem by enabling us to transfer information from one skill to anotherand even from one robot to another. We show that neural network policies can bedecomposed into "task-specific" and "robot-specific" modules, where thetask-specific modules are shared across robots, and the robot-specific modulesare shared across all tasks on that robot. This allows for sharing taskinformation, such as perception, between robots and sharing robot information,such as dynamics and kinematics, between tasks. We exploit this decompositionto train mix-and-match modules that can solve new robot-task combinations thatwere not seen during training. Using a novel neural network architecture, wedemonstrate the effectiveness of our transfer method for enabling zero-shotgeneralization with a variety of robots and tasks in simulation for both visualand non-visual tasks.

机译：强化学习（RL）可以使各种各样的机器人技能自动化，但是学习每种新技能都需要大量的现实世界数据收集和手动表示工程来设计策略类或功能。使用深度强化学习来训练通用神经网络策略可以缓解一些通过使用表达策略类来增加人工表示工程的负担，但由于与低维，手工设计的表示相比，这种方法的效率往往比RL低，因此加剧了数据收集的挑战。转移学习可以使我们将信息从一种技能转移到另一种技能，甚至从一种机器人转移到另一种机器人，从而减轻了这个问题。我们展示了神经网络策略可以分解为“特定于任务”和“特定于机器人”的模块，其中特定于任务的模块在机器人之间共享，特定于机器人的模块在该机器人上的所有任务之间共享。这允许在机器人之间共享任务信息（例如感知），并在任务之间共享机器人信息（例如动力学和运动学）。我们利用这种分解来训练混合搭配模块，这些模块可以解决训练过程中未发现的新机器人任务组合。使用新颖的神经网络体系结构，我们证明了我们的传输方法的有效性，该方法可使用多种机器人和模拟任务在视觉和非视觉任务中实现零射击通用化。

著录项

作者
Devin, Coline; Gupta, Abhishek; Darrell, Trevor; Abbeel, Pieter; Levine, Sergey;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Evolutionary Multi-task Learning for Modular Knowledge Representation in Neural Networks [J] . Rohitash Chandra, Abhishek Gupta, Yew-Soon Ong, Neural processing letters . 2018,第3期

机译：神经网络中模块化知识表示的进化多任务学习
2. Multi-task transfer learning deep convolutional neural network: application to computer-aided diagnosis of breast cancer on mammograms [J] . Samala Ravi K., Chan Heang-Ping, Hadjiiski Lubomir M., Physics in medicine and biology. . 2017,第23期

机译：多任务转移学习深卷积神经网络：应用于乳房X线图乳腺癌的计算机辅助诊断
3. Learning programs is better than learning dynamics: A programmable neural network hierarchical architecture in a multi-task scenario [J] . Donnarumma Francesco, Prevete Roberto, de Giorgio Andrea, Adaptive Behavior . 2016,第1期

机译：学习程序胜于学习动力学：多任务场景中的可编程神经网络分层体系结构
4. Learning modular neural network policies for multi-task and multi-robot transfer [C] . Coline Devin, Abhishek Gupta, Trevor Darrell, IEEE International Conference on Robotics and Automation . 2017

机译：学习用于多任务和多机器人传输的模块化神经网络策略
5. Multi-task learning deep neural networks for automatic speech recognition [D] . Chen, Dongpeng. 2015

机译：多任务学习深度神经网络自动语音识别
6. Multi-task transfer learning deep convolutional neural network: Application to computer-aided diagnosis of breast cancer on mammograms [O] . Ravi K Samala, Heang-Ping Chan, Lubomir M Hadjiiski, -1

机译：多任务转移学习深度卷积神经网络：在乳腺钼靶计算机辅助诊断中的应用
7. Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer [O] . Devin, Coline, Gupta, Abhishek, Darrell, Trevor, 2016

机译：学习多任务和多机器人的模块化神经网络策略传递

Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅